Analysis of the Effect of Distance Metric across Languages on Verse Similarity in the Qur'an
نویسندگان
چکیده
Text similarity measures have been widely studied and used in machine learning and information retrieval for many years. However, few applications of text similarity have dealt with multi-lingual translations of a specific document. Additionally, the growing number of texts with more translations being generated increases the challenge of distinguishing or identifying the similarity and differences between texts across different documents. In this article, we employ different text similarity measures to delve into the problem of text similarity in the context of multi-lingual representations of the Qur'an. Four semantic translations of the Qur'an are used for comparative study and analysis. We compare and contrast the effect of applying five similarity measures across these representations. We analyze the results along two classes namely: identical verse pairs and similar verse pairs. Our analysis provides helpful observations about the impact of the five distance metrics for verse similarity in the Qur'an across different languages.
منابع مشابه
یادگیری نیمه نظارتی کرنل مرکب با استفاده از تکنیکهای یادگیری معیار فاصله
Distance metric has a key role in many machine learning and computer vision algorithms so that choosing an appropriate distance metric has a direct effect on the performance of such algorithms. Recently, distance metric learning using labeled data or other available supervisory information has become a very active research area in machine learning applications. Studies in this area have shown t...
متن کاملComposite Kernel Optimization in Semi-Supervised Metric
Machine-learning solutions to classification, clustering and matching problems critically depend on the adopted metric, which in the past was selected heuristically. In the last decade, it has been demonstrated that an appropriate metric can be learnt from data, resulting in superior performance as compared with traditional metrics. This has recently stimulated a considerable interest in the to...
متن کاملAn Effective Approach for Robust Metric Learning in the Presence of Label Noise
Many algorithms in machine learning, pattern recognition, and data mining are based on a similarity/distance measure. For example, the kNN classifier and clustering algorithms such as k-means require a similarity/distance function. Also, in Content-Based Information Retrieval (CBIR) systems, we need to rank the retrieved objects based on the similarity to the query. As generic measures such as ...
متن کاملMonotheism in Help-Asking and its Educational Effects
From the point of view of the Holy Quran, monotheism as the most basic pillar of faith has different aspects of which the most important one is monotheism in recourse. Tawhid in help-asking along with Tawhid in worship is one of the important topics in the field of Qur'anic research on which various verses have been revealed and studied by commentators. In this article, after giving a terminolo...
متن کاملComparative Study of Verse Similarity for Multi-lingual Representations of the Qur’an
Text similarity is a subject that has received great attention in recent years. However, the application of text similarity tools to Semitic languages such as Arabic faces unique challenges. Moreover, the increasing number of texts being made available online, not only in native languages but also in translation, adds further challenge to identifying similar portions of texts across different d...
متن کامل